Add support for HTML indexes #719

charliermarsh · 2023-12-24T15:24:35Z

Summary

This PR adds support for HTML index responses (as with --index-url=https://download.pytorch.org/whl).

Closes #412.

zanieb · 2023-12-24T16:43:52Z

Why the rush? :p

charliermarsh · 2023-12-24T17:03:22Z

I didn’t assume anyone would be reviewing anything for the next week and I don’t want PRs to sit, so erring on the side of merging. If anyone wants to review they are welcome and I will fix in follow-up commits.

konstin

nice work!

konstin · 2023-12-26T13:01:17Z

crates/puffin-client/src/error.rs

+    #[error("Invalid `Content-Type` header for {0}")]
+    InvalidContentTypeHeader(Url, #[source] http::header::ToStrError),
+
+    #[error("Unsupported `Content-Type` \"{1}\" for {0}")]


This should tell the user which content types we support

konstin · 2023-12-26T13:02:54Z

crates/puffin-client/src/html.rs

+    Utf8(#[from] std::str::Utf8Error),
+
+    #[error(transparent)]
+    UrlParse(#[from] url::ParseError),


This should also show the input string, url::ParseError is a value-less enum

konstin · 2023-12-26T13:06:25Z

crates/puffin-client/src/html.rs

+
+/// Parse the list of [`File`]s from the simple HTML page returned by the given URL.
+pub(crate) fn parse_simple(text: &str, base: &Url) -> Result<Vec<File>, Error> {
+    let dom = tl::parse(text, tl::ParserOptions::default()).unwrap();


This needs an unwrap-safety comment

konstin · 2023-12-26T13:20:35Z

crates/puffin-client/src/html.rs

+    // Extract the hash, which should be in the fragment.
+    let hashes = url
+        .fragment()
+        .map(|fragment| parse_hash(fragment, &url))
+        .transpose()?
+        .ok_or_else(|| Error::MissingHash(url.clone()))?;


I don't think we can make hashes mandatory, in PEP 503 hashes are a SHOULD.

Agreed, but they’re currently required everywhere else.

konstin · 2023-12-26T13:31:37Z

crates/puffin-client/src/registry_client.rs

+    /// Return the `Accept` header value for all supported media types.
+    #[inline]
+    const fn accepts() -> &'static str {
+        "application/vnd.pypi.simple.v1+json, application/vnd.pypi.simple.v1+html;q=0.2, text/html"


i'd q=0.2 the text/html too

Added in the relative URLs PR but I’ll carve it out into a separate change if that doesn’t merge soon.

charliermarsh · 2023-12-26T14:05:06Z

Thanks for the nice review!

See: #719

charliermarsh added the enhancement New feature or improvement to existing functionality label Dec 24, 2023

charliermarsh force-pushed the charlie/html branch 4 times, most recently from fc8acef to f3afff9 Compare December 24, 2023 15:50

charliermarsh enabled auto-merge (squash) December 24, 2023 15:50

charliermarsh force-pushed the charlie/html branch from f3afff9 to 49240e7 Compare December 24, 2023 15:58

Add support for HTML indexes

49240e7

charliermarsh merged commit 5bce699 into main Dec 24, 2023

charliermarsh deleted the charlie/html branch December 24, 2023 16:04

konstin reviewed Dec 26, 2023

View reviewed changes

charliermarsh mentioned this pull request Dec 26, 2023

Review feedback for HTML indexes #733

Merged

charliermarsh added a commit that referenced this pull request Dec 26, 2023

Review feedback for HTML indexes (#733)

ae83a74

See: #719

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for HTML indexes #719

Add support for HTML indexes #719

Uh oh!

charliermarsh commented Dec 24, 2023

Uh oh!

zanieb commented Dec 24, 2023

Uh oh!

charliermarsh commented Dec 24, 2023

Uh oh!

konstin left a comment

Uh oh!

konstin Dec 26, 2023

Uh oh!

konstin Dec 26, 2023

Uh oh!

konstin Dec 26, 2023

Uh oh!

konstin Dec 26, 2023

Uh oh!

charliermarsh Dec 26, 2023

Uh oh!

konstin Dec 26, 2023

Uh oh!

charliermarsh Dec 26, 2023

Uh oh!

charliermarsh commented Dec 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add support for HTML indexes #719

Add support for HTML indexes #719

Uh oh!

Conversation

charliermarsh commented Dec 24, 2023

Summary

Uh oh!

zanieb commented Dec 24, 2023

Uh oh!

charliermarsh commented Dec 24, 2023

Uh oh!

konstin left a comment

Choose a reason for hiding this comment

Uh oh!

konstin Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

konstin Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

konstin Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

konstin Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

charliermarsh Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

konstin Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

charliermarsh Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

charliermarsh commented Dec 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants